Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

Continual GUI Agents

Add code
Jan 29, 2026
Viaarxiv icon

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Add code
Jan 29, 2026
Viaarxiv icon

OnlineSI: Taming Large Language Model for Online 3D Understanding and Grounding

Add code
Jan 23, 2026
Viaarxiv icon

StableWorld: Towards Stable and Consistent Long Interactive Video Generation

Add code
Jan 21, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Add code
Dec 24, 2025
Figure 1 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 2 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 3 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 4 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Viaarxiv icon

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Add code
Dec 22, 2025
Viaarxiv icon

Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

Add code
Dec 18, 2025
Viaarxiv icon

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Add code
Dec 15, 2025
Viaarxiv icon

The Best of the Two Worlds: Harmonizing Semantic and Hash IDs for Sequential Recommendation

Add code
Dec 11, 2025
Viaarxiv icon